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Abstract 

Background: Massive datasets comprising high-resolution images, generated in neuro-imaging studies and in 
clinical imaging research, are increasingly challenging our ability to analyze, share, and filter such images in clinical 
and basic translational research. Pivot collection exploratory analysis provides each user the ability to fully interact 
with the massive amounts of visual data to fully facilitate sufficient sorting, flexibility and speed to fluidly access, 
explore or analyze the massive image data sets of high-resolution images and their associated meta information, 
such as neuro-imaging databases from the Allen Brain Atlas. It is used in clustering, filtering, data sharing and 
classifying of the visual data into various deep zoom levels and meta information categories to detect the 
underlying hidden pattern within the data set that has been used. 

Method: We deployed prototype Pivot collections using the Linux CentOS running on the Apache web server. We 
also tested the prototype Pivot collections on other operating systems like Windows (the most common variants) 
and UNIX, etc. It is demonstrated that the approach yields very good results when compared with other 
approaches used by some researchers for generation, creation, and clustering of massive image collections such as 
the coronal and horizontal sections of the mouse brain from the Allen Brain Atlas. 

Results: Pivot visual analytics was used to analyze a prototype of dataset Dab2 co-expressed genes from the Allen 
Brain Atlas. The metadata along with high-resolution images were automatically extracted using the Allen Brain 
Atlas API. It is then used to identify the hidden information based on the various categories and conditions applied 
by using options generated from automated collection. A metadata category like chromosome, as well as data for 
individual cases like sex, age, and plan attributes of a particular gene, is used to filter, sort and to determine if 
there exist other genes with a similar characteristics to Dab2. And online access to the mouse brain pivot 
collection can be viewed using the link http://edtech-dev.uthsc.edu/CTSI/teeDev1/unittest/PaPa/collection.html 
(user name: tviangte and password: demome) 

Conclusions: Our proposed algorithm has automated the creation of large image Pivot collections; this will enable 
investigators of clinical research projects to easily and quickly analyse the image collections through a perspective 
that is useful for making critical decisions about the image patterns discovered. 
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Background 

Recent research in laboratory science and clinical trial 
studies has given rise to the generation of massive neuro- 
images and clinical image collections of very high-resolu- 
tion. Thus in order to deliver massive collections so that 
researchers can fully interact, share, and filter image col- 
lections, an online, real-time research collaborative 
approach has become a very key challenge. With the 
recent expansion of browsing capabilities and increased 
network performance [1,2], delivering massive image col- 
lections has become feasible for translational researchers 
and clinician-scientists to analyze, interpret and even 
possibly diagnose from these distributed networked 
image collections. Given these improvements in recent 
modern web service technologies, a basic component to 
be considered when developing these distributed image 
portals for viewing massive image collections is the ability 
to efficiently interact with and effectively search large 
amounts of data to answer multi-dimensional analytical 
queries and its augmentation with pertinent experiential 
knowledge. The Microsoft Live Labs Pivot [3] provides a 
unique alternative way of viewing massive information 
and allows users to discover any recognizable patterns, 
thus offering the potential for exploring new ideas that 
can be used to test the research hypotheses and promote 
translational research. The Pivot collections use Deep- 
Zoom [4] technology and Silverlight [5] browser capabil- 
ity to display high-resolution image collections, which is 
a very fast and smooth zooming technology that provides 
user with a quick way to navigate through high-resolu- 
tion images in multiple zoom levels. 

In this article, we defined the minimum set of require- 
ments necessary to implement such an automated pro- 
cess for Pivot image collections [6,7]. The large sets of 
images were obtained from the Allen Brain Atlas [8], a 
genome-wide database resource of full colour, high-reso- 
lution gene expression patterns in the mouse brain [9]. 
The Pivot collections were deployed in a Linux CentOS 
5 2-dual core with 2.26 GHz Xeon processors (Dell 
R610) running Apache web server [10]; Graphics::DZI 
[11] is a Perl module [12] that can be used in creating 
and generating titles from a given collection of images 
but in this study Python modules [4,6,7] were used. The 
images are generated like the tree in the DeepZoom [4] 
pyramid, thus making it possible to be expanded or 
zoomed to yield high-resolution zoom levels. The Pivot 
collections were tested precisely on various operating 
systems like Windows 7, Vista and Mac OS X. In addi- 
tion to Pivot collections, this discussion will include the 
full benefits of DZI generation, which are expansions 
and implementation of analytic and statistical functions 
that will extract meaningful information from Pivot 
collections. 



A lot of effort has been expended in developing web- 
based image management systems which offer the poten- 
tial to access image collections for research clinician- 
scientists [13]. Several centers and National Institutes of 
Health (NIH) have invested heavily in individual image- 
data storage and retrieval systems. Notable among them 
are the National Center for Research Resources' "Biome- 
dical Informatics Research Network" (BIRN) and the 
National Cancer Institute's "cancer Biomedical Infor- 
matics Grid" (caBIG) [14]. The BIRN system [15] 
extracts/retrieves and then transmits images from a 
source, while caBIG manages oncology and radiology 
images from multiple sources through its web servers 
[14]. These systems encourage and foster collaboration 
among individuals and research groups. The Allen 
Mouse and Human Brain Atlas [8] provide an interactive, 
genome-wide image database of gene expression as a 
web-based resource to present a comprehensive online 
platform for the exploration of mouse and human brain 
research [8]. The BrainMaps is an NIH-funded project 
that provides an online interactive zoomed high-resolu- 
tion digital brain atlas of massive scanned brain structure 
images in both primate and non-primate serial sections 
for research and didactic settings [16]. 

The Mouse Brain Library (MBL) provides massive 
image collections of mouse brain structure that consists 
of the most comprehensive sets of recombinant inbred 
strains, including BXD, LXS, AXB, BXH, and CXB for 
studies of the genetic control, function, and behaviour 
[9]. The Web Quantitative Trait Loci (QTL) provides a 
collection of images from 200 well defined strains of 
mice, 2200 cases, 8800 Nissl-stained slides, and about 
120,000 coronal and horizontal sections of the mouse 
brain to support deep understanding on the genetic 
variability axis [17]. The Surface Management System 
(SuMS) database consists of large numbers of complex 
surface-related datasets of the cerebral cortex that many 
believed to be human functions which provide connec- 
tivity for learning, emotion, sensation, and movement 
[18]. The neuroscience community can access SuMS for 
searching and federating meta-information across all the 
datasets using a Web interface version (WebSuMS). The 
Gene Expression Nervous System Atlas (GENSAT) data- 
base is sponsored by National Institute of Neurological 
Disorders and Stroke (NINDS) [19]. The GENSAT is 
well-constructed and positioned to act as a database 
that contains large collections for a gene expression 
atlas of the central nervous system of the mouse based 
on bacterial artificial chromosomes (BACs) [20]. 

The National Cancer Institute of "National Biomedical 
Imaging Archive" (NBIA) provides an online image 
repository tool to access imaging resources that aims to 
improve the use of imaging in increasing the efficiency 
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of imaging cancer detection, diagnosis, therapeutic 
response, and improved clinical decision-making sup- 
port [21]. Notable projects using NBIA database are the 
Reference Image Database to Evaluate Response 
(RIDER) [22], Lung Image Database Consortium (LIDC) 
[23] and Virtual Colonoscopy Collection [13]. The 
RIDER is a collaborative pilot project sponsored by NCI 
that provides a resource of full-chest DICOM CT 
scanned images for the patient response to therapy in 
lung cancer treatment. The LIDC provides a Web acces- 
sible platform that consists of low-dose helical CT scan 
collections for lung cancer in adult patients. The Inter- 
national Consortium for Brain Mapping (ICBM) is a 
web interface for an anatomically-labelled brain database 
sponsored by NCI [16]. The neuro-imaging and related 
clinical data can be accessed by searching through a 
user-friendly environment called LONI Image Data 
Archive (IDA) [24]. The IDA is currently used for many 
neuroscience projects across North America and Europe 
for Magnetic Resonance Imaging (MRI), Positron Emis- 
sion Tomography (PET), Magnetic Resonance Angio- 
gram (MRA) and Diffusion Tensor Imaging (DTI). 

Finally, several image-data management techniques 
with varying levels of complexity are available to related 
clinical researchers. The Java-based remote viewing sta- 
tion JaRViS was an early example of a medical image 
viewing and report generating tool that exploited local- 
area network systems for web-based image processing of 
diagnostic images generated through nuclear medicine 
[13]. Kalinski et al. introduced virtual 3D microscopy 
using JPEG2000 for the visualization of pathology speci- 
mens in the Digital Imaging and Communications in 
Medicine (DICOM) format to create a knowledge data- 
base and online learning platforms [25]. Kim et al. pro- 
posed the Functional Imaging Web (FlWeb) [26]. The 
FlWeb is a web-based medical image data processing 
and management system that uses Python and Java- 
Script for rendering a graphical user interface (GUI); it 
also uses Java Applets for development of online image 
processing functions. The creation of a massive data- 
image collection with bioinformatics functionality elimi- 
nates the problem that is encountered in querying and 
searching a large image set of data. It also enhances 
data transmission and collaboration. 

Methods 

Pivot Collection Requirements 

The minimum set of requirements needed to run Pivot 
collections is as stated below: 

♦ Collection.cxml -The collection extensible markup 
language (XML) consists of a set of rules to describe 
structured data to be displayed in Pivot collection. 



The CXML file contains the set of categories and 
types associated with it. The types are String, Long- 
String, Number, Date, Time and Link that describe 
the majority of the information associated with the 
individual images in the collection. The automation 
of a process in generating CXML file from a given 
set of images is described in section 3.1. 

♦ Collection.xml -This XML contains the unique set 
of identifications (ID) and the size of images (i.e., 
width and height) that are assigned to an individual 
image along with zoom levels of information. Collec- 
tion.xml is automatically created when we run the 
"deepzoom" function in python to subdivide high- 
resolution images into various zoom levels. The 
"deepzoom" function in python can be downloaded 
at [27]. 

♦ Python Imaging Library (PIL) [28] - PIL provides 
image processing functionality and supports many 
file formats. We adopted the PIL version 1.1.6 
(python-imaging- 1.1.6) to work in conjunction with 
Python Deep Zoom Tools for Pivot collections. 

♦ Python Deep Zoom Tools -The deep-zoom-tool 
version 0.1.0 [27] was adopted to run subdividing 
high-resolution image into various zoom levels 
described in Collection.xml. 

♦ Collection.html -This hypertext mark-up language 
(html) file contains necessary information to run the 
Silverlight browser capability to display high-resolu- 
tion image collections in client user browser. 

♦ Silverlight.js -The JavaScript file uses Silverlight 
browser capability. It can be downloaded from 
Microsoft Live Labs Pivot website [5]. 

♦ PivotSimpleDemo.xap -This is a compiled file for- 
mat that renders the graphical user-friendly interface 
(GUI). It is a Microsoft Silverlight [5] application 
that was developed in-house which is used as a Pivot 
viewer. 

♦ Collection files -This contains a set of image col- 
lections in various zoom levels indicated as dzi for- 
mat. The dzi format is the deep zoom file format 
obtained from Python Deep Zoom Tools. 

Automation of the Process of Creating Collection XML 
(CXML) 

We have automated the process of creating collection 
XML (CXML) with the use of XmlWriter Class [29] 
which is written in Hypertext Pre-processor (PHP). The 
PHP is a dynamic language that is a widely-used for 
Web interface and application development purposes. 
The structure of CXML is quite simple. Below example 
(See Figure 1) specifies a simple collection with the only 
one item. 
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<?xml version="1.0"?> 
Collection Mame="Heiio World Collection 
— F acetCategori es> 

<FaceOtegory Name=-Gene Name" Type= ,t String 4 '/> 
— *<if acetCateg ories > 

cltems imgBase- 'collection filesvcollection xmr> 
<ltem img^'W ld= M Cr HreNTittp //mouse brain-map, org 1 
< Description > Gene expression: DAB2</Description> 

< Facets > ^ . /"\ 

<Facet Name =" Gene name w > f*J 
<String Value- ur 
</Facet> 
</Fac€ts> 
</item> 
/items > 
</Collediom 





Name="DAB2" 




1. Top-level container for 
Pivot collections 

2. A grouping of facets 
which appears in the 
Pivot filter panel and in 
the info panel (e.g., gene 
name, gene type) 

3. This is a container for all 
images in the Pivot 
collection, 

4. This is a container for a 
value on an image that 
can be used to filter, sort, 
and display information 
about an image. 



Figure 1 Example of a simple collection with only one item (DAB2 gene). Example of a simple collection with only one item (DAB2 gene). 



Pivot Collection Architecture 

The Pivot collection architecture is comprised of two 
main components that provide the ability to create a set 
of dzi formats from a large number of high-resolution 
images through the DeepZoom 

Tier and view Pivot image collections using the Web 
Application Tier (See Figure 2). The image collections 
are stored in the image database. 

DeepZoom Tier 

The DeepZoom Tier is responsible for detecting a col- 
lection request from a user through the web application 
tier. It is used in processing, creating, and hosting the 
pivot collections. The technologies necessary to run the 
DeepZoom Tier includes PHP, Python efficiently config- 
ured with Python Image Library, MySQL database, 
deep-zoom library and the Apache web-server. 
The deep-zoom server is comprised of two main 
components that provide the ability to work as an 




Figure 2 Modify Pivot Collection Architecture. A modified 
architecture showing Pivot collections. 



intercommunicating autonomous system and perform 
their respective functions. 

♦ Migration and structuring component: This compo- 
nent fetches one pivot-request file each time from the 
pivot request queue, migrates the images from the 
image datasets and creates the appropriate file struc- 
ture required by the pivot collection (See Figure 3). 

♦ The Pivot creation, verification and hosting com- 
ponent is composed of four main steps as stated 
below: 

1. Fetch one pivot-request from the pivot-inter- 
queue. 

2. Create deep-zoom image partitions. 

3. Create CXML, XML file and the appropriate 
directory structure required for pivot hosting 
specifications. 

4. Host the collection on the local web-server 
(See Figure 4). 

The fundamental function of the DeepZoom Tier is to 
accept the client-side requests and then generate a set 
of dzi formats along with information about zoom levels 
from a given set of high-resolution images. The commu- 
nication channel (C2) between the Web Application 
Tier and DeepZoom Tier uses Asynchronous JavaScript 
and XML (Ajax). In this case, we use Ajax to link PHP 
and Python Deep Zoom. To run python on Apache, we 
have to map common gateway interface (CGI) file 
extensions to handlers by un-commenting "AddHandler 
cgi-script.cgi" under the "httpd.conf" file. The "httpd. 
conf" is the Apache configuration file. We set the per- 
missions of the root directory, or the directory which 
contains the python files (See Figure 5). 
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start 



Wait 



check for pivot 
request in pivot-queue 



NO 




Fetch the request file 
images and create 
directory structure 



Update pivot-inter- 
queue 



Figure 3 Modify Apache directive to handle deep zoom script 
in python. Flowchart showing the migration and structuring 
component of the algorithm. 



Web Application Tier 

The Web Application Tier consists of three main com- 
ponents that provide an interface to the mouse brain 
image database, python imaging library and deep zoom 
tools. The components are as follows; 

♦ The Application Programming Interface (API). The 
purpose of the API is to enable the user to cluster 
and classify mouse brain images for a given gene 
symbol from the Allen Brain Atlas (see Figure 6). 
The communication channel (CI), which is the web 
browser, provides users with the ability to conduct 
real-time searches of related research images for 





start 















vv 
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dll 
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check for request in 
pivot-inter-queue 



NO 




Initiate deep-zoom 
sequence 



Verify output 
structure 



Host the collection 
and update status 



Figure 4 Migration and structuring component flowchart. 

Flowchart showing Pivot creation, verification and hosting. 



Pivot collection. As depicted in Figure 6, the set of 
mouse brain images in the sagittal view are retrieved 
through gene symbol query submitted by the user. 
These images represent gene expression maps for 
the mouse brain using high-throughput procedures 
for in situ hybridization. 

♦ Creation of Pivot image collections from a given 
set of images. The Web Application Tier sum- 
marizes the total set of images and sends them to 
the DeepZoom Tier. The dzi generation process 



Q 



/ 



<Oirectory \ /Apache/htdocs> 



\ 



Options ExecCGI ^~ 
Orderallow,deny 
Allow from all 



SeiHandler cgi-script — ^Tj 
</Directory 



1 . Directive in httpd.conf 

2. Allow Apache to run python 

3. Allow Apache to handler the cgi-script 

The option 2 and 3 are important which will help 
rendering python on Apache, To take effect, we have to 
restart the Apache server 



Figure 5 Pivot creation, verification and hosting component. Apache directive that handles deep zoom script in python environment. 
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takes place in queue scheduling priority. Once the 
process is completed, the DeepZoom Tier will be 
sending an auto response back to the Web Applica- 
tion Tier. A Linux CentOS 5 2-dual core 2.26 GHz 
Xeon processors (Dell R610) running Apache was 
deployed to run the process of dzi generation at the 
DeepZoom Tier. We intend to implement this on a 
Linux Centos 5 8-core system (Dell R610) to handle 
higher volume of collection requests. 
♦ Display Pivot image collections. The Pivot image 
collections are shown in Figure 7 and Figure 8. 



14 shows the classification of the genes based on the 
chromosomes metadata, while Figure 15 shows that 
genes that are extracted when the sorting is based on 
gene name, chromosome (4, 9 and X) and at age 55. 
The same can be used to answer more complicated 
questions by applying more filters and conditions 
visually on the collection. And online access to the 
mouse brain pivot collection can be viewed using the 
link http://edtech-dev.uthsc.edu/CTSI/teeDevl/unittest/ 
PaPa/collection.html (user name: tviangte and password: 
demome) 



Results 

We used a total of 1087 genes [8] that are co-expressed 
with Dab2 from the Allen Brain Atlas [8]. The different 
categories of genes used include gene name (see Table 1 
below) and meta information for imageseriesid, plane, 
sex, age, treatmenttype, strain, specimenid, riboprobe- 
nam, probeorientation, position, imagedisplayname, 
referenceatlasindex to classify and cluster the genes to 
identify the hidden information relating to each of the 
meta information columns. Figure 9 shows a broad view 
of all the genes used. Figure 10 shows the classification 
of the gene based on the gene name. Figures 11, 12 and 
13 show the images of expressions filtered with their 
associated data using Abcal, Klhdc8b and Pdlim4 genes 
respectively in the sagittal plane as a case study. Figure 



Discussion 

This article has shown that with the necessary require- 
ments like massive lab-imaging, an automated process 
for Pivot image collections can be generated. The Pivot 
collection is used in the clustering and classification of 
visual data into various deep zoom levels to detect the 
underlying hidden patterns within the data sets. This 
resource gives the user dynamic predictive ability with 
regard to the data items and also serves as a visualiza- 
tion tool. However, there are some limitations of the 
current Pivot implementation. The image analysis would 
typically be used by scientists and clinicians to examine 
research hypotheses that are defined with this current 
Pivot collection technology. There is also the expansion 
of the Analysis Tier to implement analytical and statisti- 
cal functions which extracts meaningful information 
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Figure 6 Pivot Collection Architecture. Pivot collection architecture using the mouse brain gene as an example. 
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Figure 7 Pivot collections. A. Forest view of gene expression maps for 1 1 gene symbols. B. Tree view of gene expression maps for 4 
gene symbols. Pivot collections. A. Forest view of gene expression maps for 1 1 gene symbols. B. Tree view of gene expression maps for 4 
gene symbols. 





Figure 8 Pivot collections C. Gpd2 -gene expression map in hippocampus (coronal view). D. Progressively deep zoom level (Gpd2). 

Pivot collections C. Gpd2 - Gene expression map in hippocampus (coronal view). D. Progressively deep zoom level (Gpd2). 
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Table 1 Names and number of genes 


Gene Name 


Number 


1810009M01Rik 148 


2610528K1 1 Rik 


16 


543041 9D17Rik 18 


A230063L24KIK 


1 7 


Abcal 


20 


Agtrap 


1 7 


BC054438 


74 


Calu 


16 


Cd63 


1 1 9 


Col8a2 


1 9 


Dffb 


17 


Dhx15 


19 


Efs 


19 


Enpp7 


19 


Gprin2 


20 


Hcn4 


17 


led 1 


17 


Kif13b 


25 


Klhdc8b 


20 


Limdl 


17 


Lrch2 


20 


MagealO 18 


Mybl2 


12 


Notch2 


19 


Pak4 


20 


Pdlim4 18 


Phkal 


19 


Rbm27 


16 


Sec24b 18 


Slc22a8 80 


Slc7a11 


20 


SmpdBa 


20 


Sod3 


76 


Ss18 


20 


Tiel 


20 


Timd2 


17 


Xtrp3s1 


20 


Total 


1087 


Names of the 1087 genes used in the Pivot collection analysis. 



from Pivot collections such as an image marker to iden- 
tify and share ROI, image feature comparison, and built- 
in basic statistical functions (e.g., t test, ANOVA, Corre- 
lation matrix) or an interface to statistical or imaging 
processing tool boxes (e.g., MATLAB), which will ulti- 
mately benefit the research community. 









■'I'""" 













Figure 9 Gene's Broad view. Broad view of genes pivot collection. 

V J 



The middle layer service was written in PHP [30] to 
retrieve massive data sets along with metadata in XML 
format [31] from the Allen Brain Atlas using their pro- 
prietary API. The Pivot analysis provides users with 
ability to split, filter, and sort constraint variables and 
allows browsing at the different conceptual levels 
enabling mining processes, such as a decision tree 
mining. This enhances a key challenge to delivery of 
massive image collections of high-resolution images 
such as the Allen Brain Atlas projects and BrainMaps 
[16]. Pivot's user interface provides users with fast, 
interactive and intuitive online technologies to swiftly 
answer multi-dimensional, analytical queries from 
Pivot collections and support just-in-time resources 
and tools to be used to test the research hypotheses. 
For example, Figure 12 shows how genes are expressed 
in Chromosome 11 at any given age of the gene. 
Unlike other machine learning algorithms [32-34], 
which are restricted to a particular domain, our pro- 
posed algorithm/method can be applied to other bio- 
medical domains like biomedical literature (text 
mining) and also within the geo-spatial domain [35] 
etc. In literature mining the text is converted into 
matrices to express words in sentence, while in geo- 
spatial mining, our algorithm can be used to identify 
the location (including the coordinates), age and sex of 
a given population; thus this is an algorithm with mul- 
tiple functions and applications. With our ongoing 




Figure 10 Classification of genes. Pivot collection showing 

classification of genes, 
k ) 
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Figure 11 Abcal gene. Broad view of Abcal gene in the Pivot 
collection. 



development, we have no intention of reinventing 
advanced statistical software packages, such as SAS 
and image processing tools such as those in the 
MATLAB toolbox. On the contrary, we will develop a 
seamless interface to bridge Pivot collections with 
those software packages. We will provide users the 
ability to pre-process their biomedical images and 
neuro-imaging and then isolate extracted datasets for 
external examination (e.g., image-mining). 

Conclusions 

Clinicians, laboratory researchers and other health-care 
providers generate a massive amount of neuro-imaging 
and clinical images in high resolution on daily basis; in 
fact, there is an "Information overload" [32] with the 
image collections being generated. Furthermore, the 
most common internet browsers now have improved 
browsing capabilities and network performance with 
respect to retrieving and downloading/uploading infor- 
mation from the internet/intranet or other online 
media. Thus it has become imperative for investigators 
of clinical projects to improve their method of image 
collection, to keep abreast and leverage the latest tech- 
nology development in relation to image collection, 
collaboration, storing and transmission. Microsoft Live 
Labs Pivot technology empowers investigators of clini- 
cal research projects to interact with a massive amount 
of image collections seamlessly, thereby enabling them 
to analyze, filter, collaborate and share the image 




Figure 12 Klhdc8b gene. Broad view of Klhdc8b gene in the Pivot 
collection. 

v / 




Figure 13 Pdlim4 gene. Broad view of Pdlim4 gene in the Pivot 
collection. 

v. / 



collections. The visual enhancement in the Microsoft 
Pivot technology makes data extraction and filtration 
very easy to use. We have proposed an automated pro- 
cess that will enable Microsoft Live Labs Pivot technol- 
ogy to create large sets of image collections. 

Compared with any other data analysis technique 
like data mining, knowledge management and discov- 
ery, information and cognitive reasoning [34,36,37]; 
Azuaje [34] predicts cluster number from a given col- 
lection set without applying visual analytic component 
to the prediction making it difficult to identify the hid- 
den information from the collection. Barbarane [36] 
evaluates cluster analysis solutions without identifying 
the hidden information in a data collection set. Badu- 
lescu [37] reviewed data mining algorithms used in 
clustering a data set collection but all the algorithms 
reviewed by Badulescu [37] does not visually analysed 
the data set to detect the hidden data/information. 
Pivot collection's visual analytics is used in identifying 
unexpected hidden information or data like strains of 
features that have been expressed in a gene database 
as shown in Figures 11, 12 and 13. Thus Pivot visual 
analytics is used in analyzing hard complex problems 
that other machine learning algorithms would other- 
wise find it difficult to analyze. It relieves the user 
from complex and sometimes complicated mathemati- 
cal and statistical formulas associated with other 
machine learning methods and algorithms [33]. Our 




Figure 14 Classification of genes based on chromosome 

metadata. Pivot collection showing classification of genes based on 

chromosome metadata, 
k ) 
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proposed algorithm has automated the creation of 
large image Pivot collections; this will enable investiga- 
tors of clinical research projects to easily and quickly 
analyze the image collections with a view that is criti- 
cal for making decisions about the image patterns 
discovered. 
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